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The Minority Game: an introductory guide 



Esteban Moro* 



Abstract 

The Minority Game is a simple model for the collective behavior of agents in an idealized situation 
where they have to compete through adaptation for a finite resource. This review summarizes the 
statistical mechanics community efforts to clear up and understand the behavior of this model. Our 
emphasis is on trying to derive the underlying effective equations which govern the dynamics of the 
original Minority Game, and on making an interpretation of the results from the point of view of the 
statistical mechanics of disordered systems. 



1 Introduction 



There are 10 stars in the galaxy. That used to be a huge nuinber But 
it's only a hundred billion. It's less than the national deficit! We used to 
call them astronomical numbers. Now we should call them economical 
numbers. 
Richard Fevnman 



During the last years, the statistical mechanics community has turned its attention to problems in 
social, biological and economic sciences. This is due to the fact that in those sciences, recent research 
has focused on the emergence of aggregates like economic institutions, migrations, patterns of behavior, 
etc., as a result of the outcome of dynamical interaction of many individual agent decisions (2| 1421 
1621 1551 . The differences of this approach with traditional studies are twofold: firstly, the fact that 
collective recognizable patterns are due to the interaction of many individuals. On the other hand, in 
the traditional neoclassical picture, agents are assumed to be hyperrational and have infinite information 
about other agents' intentions. Under these circumstances, individuals jump directly into the steady 
state. However individuals are far from being hyperrational and their behavior often changes over time. 
Thus, new models consider explictly the dynamical approach towards the steady state through evolution, 
adaptation and/or learning of individuals. The dynamical nature of the problem poses new questions like 
whether individuals are able to reach a steady state and under which circumstances this steady state is 
stable. 

The problem is thus very appealing to statistical mechanics researchers, since it is the study of many 
interacting degrees of freedom for which powerful tools and intuitions had been developed. However, 
the dynamics of the system might not be relaxing and the typical energy landscape in which steady states 
are identified with minima of a Lyapunov function so that the dynamical process executes a march which 
terminates at the bottom of one of the valleys could not be applicable. The situation is reminiscent of 
other areas at the borderline of statistical mechanics like Neural Networks 1 1 1 where sometimes is not 
possible to build up a Lyapunov function which the dynamics tend to minimize. In fact, as we will see 
below there are strong analogies of the model considered here with that area of research. 
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The model I consider here is called the Minority Game (MG) 1121 which is the mathematical for- 
mulation of "El Parol Bar" problem considered by Brian Arthur fS^. The idea behind this problem is the 
study of how many individuals may reach a collective solution to a problem under adaptation of each 
one's expectations about the future. As the models mentioned before, the MG is a dynamical system of 
many interacting degrees of freedom. However, the MG includes two new features which make it differ- 
ent: the first one is the minority rule, which makes a complete steady state in the community impossible. 
Thus, dust is never settled since individuals keep changing and adapting in quest of a non-existing equi- 
librium. The second ingredient is that the collectivity of individuals is heterogeneous, since individuals 
have different ways to tackle available information about the game and convert it into expectations about 
future. Effectively, the minority rule and heterogeneity translate into mean-field interaction, frustration 
and quenched disorder in the model, ideas which are somehow familiar to disordered systems in con- 
densed matter 1 571 1611 . Actually, the MG is related to those systems and some of the techniques of 
disordered systems can be applied to understand its behavior. 

At this point, I hope the reader is convinced that the MG is an interesting problem from the statistical 
mechanics point of view: is a complex dynamical disordered system which can be understood with 
techniques from statistical physics. In fact, most of the research about the MG problem has been done 
inside the physics community. However, the El Parol bar problem originates in the economic literature, 
although it represents a fierce assault on the conventions of standard economics. In this sense either 
the El Parol bar or the MG are interesting from the economic and game theory point of view as well. 
My intention here is to present the statistical mechanics approach to this problem and to explain how 
different concepts of economists and game theory translate into physics terminology. This biased review 
is the author's effort to explain to physicists what is known about the MG. I make no attempt to be 
encyclopedic or chronologically historical. 

2 The El Farol Bar problem 

EL FAROL, 808 Canyon Rd., Santa Fe: El Farol restaurant is where locals like to hang out. 
The roomy, somewhat ramshackle bar area has an old Spanish-western atmosphere, but you 
can order some fine Spanish brandies and sherries. On most nights you can listen to live 
flamenco, country, folk, or blues. Dancers pack the floor on weekend nights in suminer. 
Fodor's miniguide 

The El Farol Bar problem was posed as an example of inductive reasoning in scenarios of bounded 
rationality fj|. The type of rationality which is usually assumed in economics -perfect, logical, deduc- 
tive rationality - is extremely useful in generating solutions to theoretical problems |25 1, but if fails to 
account for situations in which our rationality is bounded (because agents can not cope with the com- 
plexity of the situation) or when ignorance about other agents ability and willingness to apply perfect 
rationally lead to subjective beliefs about the situation. Even in those situations, agents are not com- 
pletely irrational: they adjust their behavior based on the what they think other agents are going to do, 
and these expectations are generated endogenously by information about what other agents have done in 
the past. On the basis of these expectations, the agent takes an action, which in turn becomes a precedent 
that influences the behavior of future agents. This creates a feedback loop of the following type: 

Precedents — > Expectations 

\ / 
Actions 

Inductive reasoning assumes that by feeding back the information about the game outcome, agents could 
eventually reach perfect knowledge about the game and arrive to an steady state. On the contrary, de- 
ductive reasoning (which is commonly applied in economics, for example), assume that the precedents 
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contain full information about the game and then there is not any dynamical approach to the steady state, 
which is attained in a single step: 

Precedents (full information) — > Actions (steady state) 

The El Parol bar problem is posed in the following way: N people decide independently each week 
whether to go to a bar that offers entertainment on a certain night. Space is limited, and the evening 
is enjoyable if things are not too crowed -specifically, if fewer than aN with a < 1 of the possible 
N are present. In other case, they stay at home. Thus agents have two actions: go if they expect the 
attendance to be less than aN people or stay at home if they expect it will be overcrowded. There is 
no collusion or prior communication among the agents; the only information available is the numbers 
who came in past weeks. Note that there is no deductively rational solution to this problem, since 
given only the numbers attending in the recent past, a large number of expectational models might be 
reasonable. Thus, not knowing which model other agents might choose, a reference agent cannot choose 
his in a well-defined way. On the other hand, expectations are forced to differ: if all believe most will 
go, nobody will go, invalidating that belief. In order to advance the attendance next week each agent 
is given a fixed number of predictors which map the past weeks' attendance figures into next week. 
The idea behind those predictors is that when agents face complex scenarios like this they tend to look 
for patterns that occurred in the past and predict next move from that experience. Finally, each agent 
monitors his predictors by keeping an internal score of them which is updated each week by giving 
points or not to all of them whether they predicted the attendance or not. At each weak, each agent 
uses his predictor with bigger score. Computer simulations of this model [3J shows that the attendance 
fluctuates around aN in a {aN,{l — a)N) structure of people attending/not attending. Thus, predictors 
self-organize so that this structure emerges in the complex dynamics of the system. 

Despite the fact that the El Parol bar problem deals apparently with a non-market context it can be 
considered as a kind of very simple "proto-market" or "market toy model" I45ll48ll63l : at each time step 
agents can buy (go to the bar) or sell an asset. After each time step, the price of the asset is determined 
by a simple supply-demand rule: if there are more buyers than sellers, the price is high, and if there are 
more sellers than buyers, the price is low. If the price is high, sellers do well, while if the price is low, 
buyers win the round. Thus, the minority group always win, irrespectively of whether they were buyers 
or sellers. However, some of the behavioral assumptions on which the El Parol bar problem (and the 
MG model) is based may be questionable when applied to financial markets 1161 . Moreover, it still lacks 
some of the most basic features in a real market, e.g. the price which is determined by the aggregate 
supply-demand rule is not considered or the fact that agents do have different payoffs for their actions 
is not taken into account 1163 1 . The relation to the markets is then on the conceptual level, as markets are 
interpreted as an adaptative competitive system in which minority membership plays an important role. 
Many other complex systems are driven by this minority rule, like vehicular traffic on roads, in which 
each agent would prefer to be on an uncongested road 115 01 . packet traffic in networks, in which each 
packet will travel faster through lesser used routers [3 6] and ecologies of animals looking for food, in 
which individuals do best when they can find areas with few competitors [10 23 1. 

From the statistical mechanics point of view, we can say that in the El Parol bar problem, a system 
with N degrees of freedom has finally come to a stationary state in which the attendance fluctuates 
around aN. Some of the properties of the El Parol bar problem are very appealing to statistical mechan- 
ics, like: 

• Many {N 1) interacting agents. 

• Interaction is through the aggregate bar attendance, i.e. of the mean field type. 

• The system of N agents is frustrated, since there is not a unique winning strategy in the problem. 



3 



• Quenched disorder, since agents use different predictors to generate expectations about the future 
of the game. 

However, other features of the model are not usual in statistical mechanics, like the non-Markovian time 
dynamics induced by the fact that the predictors map past attendance into next week prediction which is 
released back to the agents. Moreover, the dynamics based on inductive reasoning may not be relaxing 
and, in principle, the steady state which the game attains could be out of equilibrium: there is no reason 
a priori to expect detailed balance or ergodicity in the system dynamics. At this stage it is not clear 
which of the main ingredients outlined before is relevant for the observed behavior Thus, it is necessary 
first to give a precise mathematical definition of the model and try to clear up what are the ingredients 
truly responsible for the model main features. 

3 The Minority Game 

Truth always rests with the minority, and the minority is always stronger than the major- 
ity, because the minority is generally formed by those who really have an opinion, while 
the strength of a majority is illusory, formed by the gangs who have no opinion-and who, 
therefore, in the next instant (when it is evident that the minority is the stronger) assume its 
opinion . . . while Truth again reverts to a new minority. 
Soren Kierkegaard 

In order to rationalize the discussion, Challet and Zhang 1121 gave a precise mathematical definition 
for the El Parol bar problem which they called the Minority Game (MG). In the MG the predictors are 
replaced by well defined strategies from a given state space and the available information used by the 
agents is accurately defined. 

In their definition of the MG model, the game consists of N (odd) agents playing as follows (see 
figure^: at each time step of the game, each of the N agents takes an action ai{t) with i = 1, . . . , A^: 
he decides either to go to the bar {ai{t) — 1) or to stay at home {ai{t) — —1). The agents who take the 
minority action win, whereas the majority looses. After a round, the total action is calculated 

N 
i=l 

The minority rule sets the comfort level at A{t) = 0, so that agent is given a payoff ~ai{t)g[A{t)] at 
each time step with g an odd function of A{t). Challet and Zhang's initial choice was g{x) = sign(x), 
but other analytical functions are more suitable for mathematical treatment like g{x) — x/N . Most 
of the MG properties are qualitatively independent of the precise analytical form of g{x) L44J . The 
information about the winning group is released to the agents which is given by W{t + 1) = s\giiA{t). 

The way agents choose their action ai{t) is by inductive reasoning: we assume that the players have 
limited analyzing power and they can only retain the last m winning groups. Moreover they base their 
decision ai [t) on these last m bits only. To this end they have a set of s > 2 strategies. A strategy is just 
a mapping from the sequence of the last m winning groups to the action of the agent, i.e. a table which 
tells the agent what to do as a function of the input signal (the last m winning groups). An instance of 
one of those strategies is given in figure^ 

Since there are 2™ possible inputs for each strategy, the total number of possible strategies for a 
given m is 2^ . At the beginning of the game each agent is given a set of s strategies randomly drawn 
from the total 2^ possible strategies. This assignment is different for each agent and thus, agents may 
or may not share the same set of strategies. Note that each strategy is determined by a 2™ dimensional 
vector r°' whose components are the output of strategy a of agent i. For example, in figure [2 the 
strategy is r^" = (—1, —1, +1, —1, —1, +1, —1, +1). If the last winning groups were —1, +1, +1, 
then the prediction for this strategy is given by the fourth component of r Thus, at each time step. 
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Figure 1: Left: An example of an strategy for m = 3. Right: Cartoon of the MG model for a given 
time step: in this case the strategy maps the last three winning groups (m = 3) into the agent decision. 
Solid thick Unes mimic how the information flows in the system: the N agents take the last m numbers 
(— 1, —1, +1 in this case) from the sequence of winning groups and perform an action accordingly. The 
N actions are transformed into the next winning group (+1 in this case) through the minority rule. This 
information is shared with the agents for their own feedback [Eq. and becomes the next number in 
the sequence of winning groups. 



the prediction of a strategy is given by its ^{t) e {1, . . . , 2™} component, where ^{t) is a number 
whose binary representation correspond to the last winning groups'. If we denote I{t) the vector whose 
components are zero excepting the component which is one, then the prediction of strategy r^" is 
given by r^" • I{t). For example, if the last three winning groups were —1, +1, +1 then = 4 and 
/(<) = (0, 0, 0, 1, 0, 0, 0). Thus, strategies are all possible 2™ -dimensional vectors with ±1 components. 

Adaptation comes in the way agents choose at each time step one of their s strategies: they take 
the strategy within their own set of strategies whose performance over time to predict the next winning 
group is biggest. In order to do that each agent i assigns virtual points pf{t) to his strategy a after each 
time step t when they predicted correctly the winning group: 

PUt + 1) = - ■ m g[Ait)] (2) 

where a = 1, . . . , s and i ^ 1, . . . , N. However these points are only virtual points as they record only 
agents' strategies performance and serve only to rank strategies within each agent set. After time t agent 
i takes the first strategy in his personal ranking which tells him what to do in the future. If we denote 
agent i best strategy at time t in his ranking as f3i{t) G {!,..., s},^ then his action at time t is given by: 

a, (i) = rf (3) 

The fact that this personal ranking can change over time makes the agents adaptative: the ranking of 
each agent's strategies can change over time and then Pi{t) could be different at different times. 

'in the binary representation of fi{t) we make the correspondence — 1 -s-^ and +1 ^ 1. Thus, if the last winning groups 
were —1, +1, +1 the binary representation of fj,{t) is Oil and fi{t) = 4. 

^When two strategies have the highest number of points, the best strategy is chosen by coin tossing. 
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Figure 2: Time evolution of the attendance for the original MG with g{x) — x and N — 301 and s — 2. 
Panels correspond to m = 2,7 and m = 15 from top to bottom. Periodic patterns can be observed for 
ni — 2 and m — 7. 



Finally, the information I{t) is updated by adding the last winning group: the only nonzero compo- 
nent of I{t + 1) is the + 1) one which is given by 1 14| 



^^it + 1) = [2^i{t) + {W{t + 1) - l)/2]modP, 



(4) 



were W{t + 1) = signA(i) is the winning group. 

Equations (|2} and Q together with the implicit ranking of each agent's strategies define the process 
of adaptation. Note that the minority rule is encoded in the function g{x) of the attendance and appears 
in the way public information I{t) is built and also when virtual points are given to the strategies. 
Finally, interaction between agents in eq. (|2ji is through the attendance A{t) which is of the mean-field 
type. The heterogeneity among agents shows up at the set of ff which could be different for different 
agents. 

An interesting point is to observe that the variables which define the state of the game at each 
time step are pf{t) together with the public information I{t). But, on the other hand, eq. © and 
introduce a non trivial feedback in the dynamics since the state of the system at time t depends on the 
last m winning groups. This is one of the characteristic features of the MG, since the dynamics given 
by eqs. (|2j-(0} is non-local in time. 



4 Coordination due to adaptation 

It is not worth an intelligent man's time to be in the majority. By definition, there are already 
enough people to do that. 
Godfrey Harold Hardy 

Initial studies of the MG model relied in simulations Jol 1121 [T3l 1451 l38l . Typical simulations of 
the MG in its original formulation g{x) — sign(x) are given in figure El As we mentioned in the 
introduction, the aggregate A{t) never settles down and it fluctuates around the comfort level, A{t) = 0, 
as observed by Arthur in his paper 1 3 1. Thus we have {A{t)) — where (• • •) is a time average for long 
times and is an average over possible realizations of r^". Despite its trivial mean value the possible 
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Figure 3: Volatility as a function of the control parameter a — 2'"/A^ for s — 2 and different number of 
agents N = 101, 201, 301, 501, 701 (□, 0, A, <l, V, respectively). Inset: Agent's mean success rate as 
function of a. 



values of A{t) display a nontrivial shape and the fluctuations are important f9]. Moreover for small 
values of m the attendance display time periodic patterns which are lost for large values of m. 

4.1 Volatility 

While the behavior of {A{t)) is somehow trivial, fluctuations of A{t) around its mean value given by 
the variance = {[A{t) — {A{t))]'^) have a more interesting behavior (see figure|3}. First note that a 
is related to the typical size of the losing group, so the smaller <t, the more winners are in the game. The 
variance is usually known as the volatility or (the inverse of) global efficiency. The behavior of <t^ 
as a function of the parameters of the model m, s and N shows a quite remarkable behavior 11431 1531 : 

• It was found by extensive simulations that a'^/N is only a function of a = 2™ /N for each value 
of s (see figure This finding not only identifies the control parameter in this model, a, but 
also paves the way for the application of tools of statistical mechanics in the thermodynamic limit 
N ^ 00. Since qualitative results are independent of s > 2 we take the simplest case s — 2 for 
the rest of the chapter. 

• For large values of a, a"^ /N approaches the value for the random choice game a"^ /N ~ 1, i.e., 
the game in which each agent randomly chooses ai {t) — — 1 or ) — 1 independently and with 
equal probability at each time step. 

• At low values of a, the average value of is very large, actually, it scales like /N ^ a^^ 
which means that a ^ N and thus the size of the losing group is much larger than N /2. 

• At intermediate values of a, the volatility a is less than the random case, and it attains a minimum 
value at a ~ 1/2. In this region, the size of the losing group is close to N/2 (which is the 
minimum possible size for the losing group). 
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The fact that a gets below the random case for a given interval of values of a suggests the possibility 
that agents coordinate in order to reach a state in which less resources are globally wasted |53 1. In the 
market formalism, this means that agents can exploit information available and predict future market 
movements, so that global efficiency ct^ is minimized. In this sense adaptation is a good mechanism 
for the agents to achieve a better solution to the MG problem for a = ac — 1/2. The solution comes 
through coordination among agents, which is contrary to their selfish intrinsic nature. Global coordina- 
tion as the better solution to a problem of competition for resources is common in different games and 
situations L25J . 

Note however that in the MG coordination is not complete and the best possible solution is not 
achieved: that in which agents alternate in groups of [N — l)/2 and {N + l)/2 which yields a"^ /N = 
1/N . In this situation the mean success rate, i.e. the frequency an agent is successful in joining the 
minority is 1/2. As shown in figure|3]the success rate in the MG is close to 1/2 around the minimum of 
/N, while it is substantially smaller in the region where the volatility is high. 

4.2 Information 

The fact that agents seem to exploit the available information led some authors to study the information 
contained in the time series of A{t). Specifically, it was found that W{t + 1) = s\g\iA{t) is independent 
of sequence of the last m attendances in the high volatility region, while there is a strong dependence 
for a > ac- From the market point of view, the region where a < ac is called efficient (although is 
socially inefficient), since the history of minority groups contains no predictive information for agents 
with memory m. On the contrary, in the region where a > ac there is significant information available 
to the strategies of the agents playing the game with memory m and the market is not efficient in this 
sense, since there are arbitrage possibilities-^ . But in either case A{t) is not a random sequence (see figure 
13 ||9]!45l. To quantify this behavior, it was proposed in |45 1 to measure the conditional probability of 
W{t + 1) knowing through the mutual entropic information of W{t) and W{t +1). Another more 
direct measure of this quantity was suggested in II4I as 



where the average of (i + 1) in (|5} is conditioned to the requirement that the last m winning groups 
are given by If there is no significant dependence between W{t + 1) and ^(i) then as we have 
akeady seen {W{t + 1)) = {signA{t)) = and H = Q. Loosely speaking H measures the information 
in the time series of A{t) available to the agents. In the market context, H ^ indicates the presence of 
information or arbitrages in the signal A{t). As seen in figure|3 H = for a < ac — 0.3 while H ^ 
for a > ac- When a ^ oo we recover H = since in that case the system of N agents behave almost 
randomly and we expect no correlations in the sequence of W{t). 

Despite the fact that H measures the information content in the sequence of A{t) it does it only 
through the correlation of the next winning group W{t + 1) with the last m winning groups. Moreover, 
H also measures the asymmetry of the response of the agents to the information available to them: when 
H ^ the system of N agents reacts differently to the available information, while when H — sW 
given information is treated equally by the agents. Thus, H has a twofold role in the game as a measure 
of the information available in the sequence of A{t) and the response of the system to a given piece of 
information /i(<). 

^The existence of arbitrage opportunities in a market implies that there exist price discrepancies of the same stock which can 
be exploit by agents to make profit by selling and buying them simultaneously. The market is efficient whenever these arbitrage 
opportunities are exploited instantaneously and therefore eliminated from the market. 




(5) 
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Figure 4: Information H (open symbols) and fraction of frozen agents (j) (full symbols) as a function 
of the control parameter a = 2"^ /N for s — 2 and to = 5, 6, 7 (circles, squares and diamonds respec- 
tively). 



4.3 Phase transition 

The fact that H = for a < ac and H ^ Q for a > — 0.3 suggests the possibility that there is a 
phase transition at a = ac which separates those two efficient and inefficient phases 1141 . Thus H is 
the order parameter of the system which measures the symmetry of W{t+ 1) in the system, a symmetry 
which is broken at a ~ ac- Since the phase transition occurs at a = ac, it means that for a fixed value 
of 771, the agents who enter the market exploit the A{t) predictability and hence reduce it. From the 
point of view of the community, ac separates two regions in which adaptation is successful in achieving 
a good global solution to the MG problem. The following table summarizes the different names for the 
phases given in different contexts: 



Context 


a < ac 


a > ac 


Volatility/ 
Global Waste 


Inefficient 
(Worse than random) 


Efficient 
(Better than random) 


Information/ 
Arbitrage in A{t) 


Efficient 
(no information, H ~ 0) 


Inefficient 
(arbitrage, H ^ 0). 



Note that the phase transition is both explained in terms of the information which is available to the 
agents and the minimization of the total waste, while only H is an order parameter. Other quantities have 
been identified as order parameters to explain the phase transition found numerically at a = ac- For 
example in 1 14| it was considered the fraction of agents (j> who always use the same strategy, i.e. those 
agents who have /3i{t) = /3i, independently of time. As we see in figure|3(/) is zero in the a < ac phase 
which means that all the agents change their strategy at least once. This is also the case for large values 
of a, since agents behave more or less randomly. However close to the transition point the fraction cj) is 
large and most of the agents stick to one of their strategies forever. In terms of (f> the phase transition can 
be understood as an "unbinding" transition, since for a < ac strategy points pf (t) remain very close to 
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each other and then agents can switch easily from one strategy to the other However, for a > Uc one 
of each agent's strategies wins in the long run and its virtual score pf{t) is much larger than the others. 

Finally, while there is no significant finite size corrections to the order parameter H, the fraction 
of frozen agents cj) seems to drop discontinuously to zero at ac when the number of agents is changed. 
Despite both order parameters indicate that the system undergoes a phase transition at a = ac it is not 
yet clear which is the nature of the phase transition (first, second order?) and whether it is an equilibrium 
or non-equilibrium phase transition. 

5 Spanning the strategy space 

Nobody goes there anymore. It's too crowded. 
Yogi Berm 1925-, American Baseball Player 

The universality of the behavior found in the last section for different number of agents, strategies 
or memory, suggests that it should depend on a generic feature of the game. Due to the minority rule, 
agents tend to differentiate: if they use the same strategy then all of them will lose. Thus, somehow 
the minority rule forces the agents to choose that strategy among theirs that makes them singular in the 
community. This translates into a repulsion of agents chosen strategy inside the strategy space. But 
the set of available strategies only contains 2^ strategies. Thus one should expect that agents are very 
different between them if the number of strategies is bigger than N, while they should behave more like 
a crowd if their strategy heterogeneity is small. In other words we should observe a change in the system 
behavior when iV ~ 2^ . However as we saw in the last section this change of behavior is observed at 
iV - 2™, i.e. a = 0(1). 

As note by various authors, while 2^ is the number of possible strategies, 2™ is the dimension of 
the space in which strategies live and thus there are only 2™ completely different strategies 1 13''32"381^. 
If agents tend to differentiate they can only do it by choosing those completely different strategies and 
this is only possible if iV < 2"\ For N > 2™ groups of agents with the same set of similar strategies 
emerge. Those agents end up using their same best strategy and become a crowd, since they react to the 
available information in the same way. The different behavior of jN can be understood in terms of 
this crowds I32II38I : 

• When iV < 2™, agents carry a considerable fraction of aU possible strategies. This means that 
for a large fraction of agents their best strategy /3j (t) is the same at time t. Then the size of this 
crowd is of the order of 0{K), which means that the variance is of order jN ^ A^. 

• On the other hand, when iV ^ 2™ the crowds will be low populated since the /3i(<) are almost 
different for any agent due to the high heterogeneity, which makes agents act independently. Thus, 
a7iV - 1. 

• Finally, for moderate values of a crowds form which share /?; (t) and with size of order of N . But 
at the same time there will be an anti-crowd which is a group of people which have the same best 
strategy but which is completely different to the one the crowd is using. Note that only at moderate 
values of a this is possible, since there is a nonzero probability that two groups of agents cannot 
share any of their strategies. In this case crowds and their anticrowds do completely the opposite: 
considering the extreme case in which the size of the crowd is iV/2 we have that jN ~ 0. 

More quantitative approximations to the size of the crowds and anticrowds are possible which reproduce 
the observed volatility behavior |38|. Another way to characterize the formation of crowds and/or 

"^Two strategies are said to be completely different if none of their components match. A quantitative measure of this is the 
Hamming distance, see Eq. 
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Figure 5: Distance of agents' best strategies as a function of a. Parameters used are iV = 51, s = 2 



anticrowds is to measure the distance between the best strategies of all the agents 1131 



where 1 1 • | |i is the Hamming distance (1-norm) in the 2™ -dimensional space of the strategies. As seen in 
figure|5] agents tend to differentiate between them and thus maximizing the distances between their best 
strategies. However, this is not possible for small values of a, since then agents are force by their limited 
memory to share their strategies. The situation is better for large a, since agents' set of strategies do 
not overlap and then the distance between them is random d ~ 1/2 II37I . However, for moderate values 
of a differentiation makes the system of agents to organize in groups with completely different best 
strategies. In the case the crowds and anticrowds are of size {N — 1) /2 and {N + l)/2 respectively, and 
knowing that the distance between two completely different strategies is 2™ we get d — N/[2{N — 1)] 
which is bigger than the random case. 

In summary, the observed behavior in the MG is the result of the minority rule in the strategy space: 
the minority rule imposes a repulsion of agents' chosen strategies at each time. However, the volume 
of the strategy space is fixed by m and thus repulsion is only effective when the dimension 2™ is much 
bigger than the number of strategies present in the game. Note that at intermediate values of a, repulsion 
of best strategies leads to coordination, since the optimal way to achieve differentiation is by joining an 
existing crowd or anticrowd. 



One of the main characteristics of the MG model is that agents adapt and learn from their past 
history to achieve a better personal record. As we saw in the previous section, the memory of the 
agents is crucial, because it determines the strategy space dimension and also because the memory is 
associated with the ability of agents to spot patterns in the available information so they can coordinate 
better. Somehow, the sequence of the winning groups contains information about the strategies of all the 
agents and having more memory gives an agent the possibility to exploit this information. For example. 




(6) 



6 Inventing the future 



The best way to predict the future is to invent it. 
Alan Kay 
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Figure 6: Left: Comparison of the model variance u^, information H (squares) and </) (circles) with 
endogenous information (open symbols) and exogenous information (full symbols). Right: Cartoon 
of the MG model using exogenous information. As in figure ^ solid thick lines mimic the flow of 
information in the system. In this case the winning group obtained through the minority rule is only 
used by the agents to rank their strategies and may differ from the one in the exogenous sequence of 
winning groups (as in the figure). 



for a fixed number of agents N, increasing their memory m could lead to better coordination and higher 
individual success. In the limit a ^ 1 however, more memory implies increasing complexity and 
random behavior. 

Since information is fed back into the system, early studies of the MG concentrated in the possibility 
that the system of N agents could exploit this information to achieve better coordination. However, 
A. Cavagna in |5| showed by means of extensive simulations that if the information which is 
given to the agents at each time step is chosen randomly and independently of time from its possible 
values ii{t) G {1, .... 2'"} the behavior of the MG remains the same regarding time averaged extensive 
quantities like the volatility, the information H or (p (see Fig.|6jl. The crucial point is that every agent 
reacts to the same piece of information whether this information is endogenous or exogenous, false or 
true. As a consequence, there is no room in this model for any kind of forecasting of the future based on 
the understanding of the past: coordination in the MG does not come through exploitation of available 
information in the sequence of the winning groups. 

Leaving aside the fact that changing the real sequence of for random numbers gives the same 
quantitative behavior for different quantities, this finding has significant importance for the theoretical 
understanding of the MG model. Specifically, equation Q drops from the definition of the model which 
is simply given by equations (|3 and ^ together with the ranking of each agent's strategies at any time 
step. In this sense, this results are useful in the theoretical understanding of the MG model, since it 
shows that the complicated feedback of the endogenous information in the system is just an irrelevant 
complication and thus, can be dropped in the way to explain the rich structure of this adaptative model. 

This fact does not mean that the memory parameter m is not relevant in the model: indeed, the 
model displays a phase transition at a given value of m for a fixed number of agents. However, from 
the results of A. Cavagna in 1 5 1 it is reasonable to conclude that the parameter m only relates to the 
dimension D ~ 2'" of the strategies space. Indeed, as we have just shown in section|5J the description 
of the phase transition observed in the MG can be just based on geometrical considerations about the 
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distribution of strategies in the space of strategies. 

As we can see in figure |6l the order parameter H does not change substantially when exogenous 
information is considered instead of endogenous one. But, as we said in section |4] H is related to 
correlations in the sequence of winning groups. Since the sequence of the winning groups is exogenous 
and random we expect in iJ = for all values of a. The fact that this is not true is due to the twofold 
role that H plays in the original MG: H is the response of the system to a given piece of information 
and does not depend on the origin of the information. But (see figure|S): 

• When endogenous information is considered, the response of the system to the available informa- 
tion does show up in the sequence of winning groups. 

• When exogenous information is considered, the response of the system is only considered to get 
the winning group and to keep record of the performance of the strategies and is not included in 
the sequence of the winning group. 

In either case, H ^ means that for a given piece of information, there is a possibility for agents to 
predict the response of the system. In this sense, memory is not irrelevant in the MG (see the discussion 
about this in Rll6ll54lV 

Despite the fact that the phase transition and macroscopic properties of the MG are the same with 
exogenous information, some properties of the MG are lost. As we show in figure|2]the attendance and 
the size of the minority groups are correlated in time and display time periodic patterns for a < a^.. 
This is lost when the information agents react to is exogenous as seen in figure although the typical 
size of the minority groups is the same and then a'^ /N is independent on the nature of the information 
used. Thus, any extension of the MG game which depends on the periodic nature of the bar attendance 
should give different results when exogenous information is considered l T7ll43ll64l . 

The fact that the memory parameter m is only relevant to determine the dimension of the strategies 
space led to the generalization of equation (|2} to consider general D-dimensional strategies and vectors 
I{t) for any value of D and not only for those with D = 2™, which is only an unnecessary complication 

CI 
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7 Maximizing the audience 



In section 0] we saw that agents adapt to achieve a personal better solution to the MG problem which 
in turn makes the total waste smaller than the random solution for a > ac- Thus, although agents are 
selfish and tend to place themselves into the minority group irrespectively of other agent's action, their 
dynamics tend to maximize the global efficiency, that is, to maximize the bar attendance. The concept 
that the system is trying to minimize a given quantity is very appealing to statistical mechanics. If this 
quantity exists, the system can be studied by considering its minima and perturbations around them. 

However, due to the minority rule, the MG never settles down: specifically, some of the agents keep 
on changing their strategies forever and nor Pi{t), neither pf{t) come to a rest^. Thus, there can not 
be any quantity based on these degrees of freedom that the MG dynamics tend to minimize. However, 
since the actions of the agents ai{t) depend on the points of the strategies and pf{t) depend on the past 
history of actions, one can wonder whether there is any time pattern in the long run that agents follow. 
Specifically, it might be that ai{t) never come to a rest, but mi{t) — J2t=o ^ii^) can converge when 
t ^ oo to a given quantity if agent i does follow any behavioral pattern. For example, agent i can be 
within the fraction (/> of people who always chooses one of his strategies and then rai{t) is the average 
of the possible outcomes of that strategy for all possible realizations of the information. This is the key 
point in the first attempt at solution of the MG model found in I15II47I . which we outline in this section. 

First of all, let us rewrite the equations of the MG at this stage. Since the observed behavior of the 
MG is independent of s > 2 we chose the minimal case s ~ 2. Then introducing <T,;(t) = s\gn[5pi{t)], 
where 5pi (t) = pj (t) — pj (t) is the difference of points between agent i two strategies, equation (|2j 
reads 

Sp,{t + 1) = Sp,{t) - 6 • m g[A{t)] (7) 

and 

N N 

where uji = {rl + r.^)/2 and Ci - ^;^)/2. With this notation, l3i{t) = {ai{t) + 3)/2. Another 

unnecessary complication is the binary payoff function g{x) = sign(a;) for the strategies. Thus, we 
focus on the linear case g{x) = x/D which allows for a simple treatment. In this case we have 

1 r ^ 1 

5p,{t + i)^ 6p, (t) m \Y.^^^^ (*)^^] • ^^(^) \ 

and one is allowed to neglect the fluctuations (law of large numbers) 

In this form, the authors in fl5" "471 argued that if the information is just a random number then 
in the long run each of the possible values of I{t) is visited with equal probability. Thus, in the limit 
N ^ oo and D ^ oo and making a temporal coarse-graining over time steps t = t/D that contain 
infinite number of updatings like Q, the right hand side of Q can be replace by its mean value over 
time L49J : 

6p, {t + T) = dp, {t) - • I ^ + (t) e.] I , (10) 

i=i 

where mi{t) = J2l'=t-T ™^ we have neglected fluctuations in favor of mean values according 

to the law of large numbers. Finally, making the approximation that the points at time t are related to 

'This can be observed in the order parameter </>, which is never equal to one. 
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a = 2"'/N 

Figure 8: Comparison of the results for the volatility in the original MG (circles) with the solution 
obtained for equation (I13> through the evaluation of the minima of Ti 



rrii (i) through the soft condition nii {t) = ta,nh[rSpi {t)] (where F is a constant) we obtain the following 
dynamical equation in the continuum approximation r ^ 



— = -2F(l-m,J 



r N 



N 



(11) 



This can be easily written as a gradient descent dynamics dmi/dr = — r(l — mf){dT-l/dmi) where 

n 2 



(12) 



Note that 7i is a positive function of nii and then H is a Lyapunov function of the dynamics given by 
il It . Making the same type of temporal coarse-graining we get 



N 



(13) 



This equations implies that the dynamics of the agents tends to minimize Ti. instead of cr^ and thus the 
properties of the MG are described by the ground states of H. It is easy to see that H is related to the 
order parameter H at least when H is very small, i.e. close to the phase transition. Then, adaptative 
agents tend to minimize the arbitrage/information rather than their collective losses a^. 

Taking into account the random character of uji and S^i, the Lyapunov function H is reminiscent of 
of the Sherrington-Kirkpatrick model |56| augmented by a random field. Ground state properties of H 
can be analyzed using standard tools from statistical mechanics of disordered systems. Averages over 
different realizations of the disorder and uji are obtained through the replica trick 1 15 47 1. Despite 
the approximations made to get to equations (II 1> and (I12> . the calculations showed that there is a phase 
transition at ac — 0.33740 and gave accurate predictions for the values of for a > ac (see figure 
|8}: in fact H — for a < ac while H ^ when a > ac- For a < ac however, the solution given by 
exploring the ground state of Ti has some mathematical problems and fails to reproduce the observed 
behavior. 
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The fact that the game for a > ac is described by the minimization of 7i shed some light also 
about the real nature of the MG: first of all, the dynamics tend to minimize a Lyapunov function which 
depends on the history of agents decisions instead of agents instantaneous decisions. And second, the 
function which agents tend to minimize has some disorder which is encoded in the "quenched" patterns 
and uoi. This last feature of the MG resembles the dynamics of attractor neural networks (ANN) 1 1 1 
where D patterns are stored through the Hebbian learning rule and the recurrent network retrieve the 
information stored in the neurons. The control parameter a — D/N measures then the ratio between 
the number of stored patterns D and the number of neurons N. In general, information retrieval in ANN 
is possible only under some conditions about the dynamics and a. This comparison is very appealing, 
since in principle agents in the MG would like to retrieve time patterns or other agents strategies to 
achieve a better personal record. Thus, is MG just an ANN of neurons trying to retrieve other agents' 
strategies? We will complete this analogy in the following sections. 



8 No news, good news 

The irrelevance of the information contained in the sequence of the winning groups found in section |6l 
poses a new question: how can it be that changing the model from being a closed system with endoge- 
nous information to an open one with exogenous information does not change its aggregate behavior? 
As we saw, the reason for that is that the aggregate behavior of the MG is encoded in the response of 
the system to the input information. Averaging this response over all possible values of the information 
will give us the mean response of the system and then we can update all strategies' points concurrently. 
For example, we found in sectionQthat coarse-graining the dynamics of the MG over many time steps 
leads to a coupled set of equations for p°'{t) in which the information I{t) is averaged out. The infor- 
mation is then simply a go-between quantity that connects agents strategies dynamics and in principle it 
is disposable. 

In order to get the averaged dynamics of the strategies for all possible values of a and not just for 
a > Qfc as in the previous section, several techniques were used 1 28l 1291 1341 1491 |58 l. Here we outline 
one of them, which is by means of the Kramers-Moyal expansion | 27|. This is a general technique to 
obtain diffusion approximations of stochastic processes. Specifically, equations (|9} define an stochastic 
process than can be approximated by a drift together with a diffusion in the space of strategies' points. 
In this case, due to the fact that the random process I{t) appears twice in equation (|9jl, only the drift 
term remains non-zero in the Kramers-Moyal approximation which yields 

SMt+l)^ Sp, (t) - • I ^ u), + a J 1 . (14) 

Comparing equation MA\ with (|9} we observe that information has been averaged out in favor of an 
effective coupling between agents' strategies, as we expected. Thus, rather than allowing the strategy 
payoff valuations to be changed at each round, only the accumulated effect on a large number of mar- 
ket decisions is used to change an agent's strategy payoff valuations. We regard equation MA\ as the 
equivalent of what in the neural network literature would be called the batch version of the conventional 
on-line MG 1341. 

Equations ( I14> are the simplest version of the dynamics to reproduce the observed behavior in the 
MG for all values of a, as shown in figure|9] Since fluctuations due to the information are averaged out, 
there is a small quantitative difference in the aggregates (like cr^), but its main features are qualitatively 
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Figure 9: Left: Results of the MG using equations (I14> (squares), compared with simulations of the 
original MG (circles). Parameters used are N — 100 and s = 2. Right: Cartoon of the MG model 
after averaging out the information: feedback performed by the agents to keep track of their strategies 
performance provides an effective interaction in the system. As in Fig. ^ solid thick lines mimic the 
flow of information in the system. 



conserved. Within this approximation, the volatility is given by 

N N 



= n + 2^h,{a^{t)) - ^ J,j{(T,{t)(j.j{t)) 

i—1 i^j — l 



(15) 



where f2 = ^ 



N 



N 



■ i^|D and 



D 



6 • 



(16) 



Although there are some quantitative differences between the original MG and the effective dynam- 
ics in the strategy space given by equations (|9}, the approximations given by ( I14> reconciliates the model 
with its endogenous nature. On the other hand, note that equations ( I14> look like equations dlOt with 
the accumulated agent temporal pattern mi{t) instead of the instantaneous ai{t). In fact, equations ( I14t 
can be written in the continuum limit like 



d{5pi) _ dn[{n,}] 



dt 



da. 



(17) 



where 'H[{(Ti}] is given by equation M2\ with ai{t) instead of mi{t). This time, Ti with cr's instead of 
m's is not a Lyapunov function of MAI in general, since the gradient is with respect to a, not p. But, as 
we saw in sectional 7i is a Lyapunov function for a> ac for mi = {(Ji{t)) |28 , 58 1. 

Since equations ilAl are the simplest dynamics of the MG, we are in the position to single out the 
main features responsible for the behavior of the MG. To this end, let us rewrite equations il4\ the 
following way: 

N 

y ] JijXi{t) 



CTi{t + 1) = sign 



5k(0) 



(18) 
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where Xi(t) — j J2t=q '^i''') the time average of agent i actions overtime. If we assume that dpi{0) = 
0, equation jl8t looks like an attractor neural network fP\ (specifically a Hopfield model), where the 
patterns ^ have been learned through the anti-Hebbian rule M6\ instead of the usual Hebbian rule = 
■ and each neuron is subject to external biases Hi. In ANNs anti-Hebbian rule is studied in models 
of paramagnetic unlearning f35"511, the process by which spurious states from the space of possible 
configurations of ai{t) are removed since they do not correspond to any stored pattern^; the opposite 
sign in the anti-Hebbian rule hinders rather that assists information retrieval in ANN. In the MG, the 
origin of the anti-Hebbian rule is different: it comes through the minority rule, which forces the behavior 
of agents in the long run to be different to any other agents behavior. 
More generally we can look at systems like 



ai{t + 1) = sign 



N 



(19) 



where Xi (t) = jj X]r=o '^^ i^^''') average of agent i behavior over the last M time steps. This 

kind of time delayed ANN were considered as possible candidates of models for storing and generating 
time sequences of patterns 1411 1461 . In the MG, the time delayed dynamics is not set up to retrieve or 
create any particular time pattern but rather comes from the inductive reasoning process, which keeps 
record of the performance of the strategies along the game. 

Thus, equations ( I19> define a new type of ANN whose main ingredients are: 

• D heterogenous patterns are stored through the anti-Hebbian rule il6\ due to the minority rule, 

• each neuron reacts to an external field given by fli, and 

• dynamics is time delayed over the last M steps because of the adaptation process. 



Some biologists have suggested that this unlearning procedure correspond to REM sleep, which is widely observed among 
mammals 1351 . 
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which are necessary to reproduced the observed behavior of the MG. Finally one may ask whether is it 
relevant to keep the whole historic performance of each strategy, i.e., Af — > cx) or just the last M < oo 
time steps^. In figure^] we see that if only the last AI strategy performances are kept with M small, 
then the system never attains better result than the random case. Actually the cases M — 1,2 reproduces 
quite accurately the high volatility region a -C etc which means that in this phase, agents react only to 
the last winning group. For a > ac keeping the record of strategies over large time periods M ^ D is 
the only way for the agents to achieve a better-than-random solution to the problem. 

Finally, it is appealing to rewrite equations jl8t like a learning process |1 1. After all, inductive 
reasoning in the MG is the way agents learn how to act in the game. If the time dependence in ( I19> 
is considered as a learning process or training of a set of N interacting perceptrons, then the learning 
process is given by 

a,{t + l) = sign[-r!, + ^ Jy(t)a,] (20) 
J,j{t) = -l-J,^a,{t + l)<jj{t) + j^J,, (21) 

This type of interacting N perceptrons (one for each agent) have been considered recently in the context 
of the MG 1 40 1 , although with a different learning rule for the perceptrons and with homogeneous agents 
(i.e. — ^j,uji — LOj). However, as we can see in (12 1> . the MG has its unique learning process which 
comes through adaptation of agents along iteration of the game. 

In summary, after clearing up the MG model we ended up with a set of heterogenous agents (or 
perceptrons) which are learning through the adaptation process M\\ the minority task given by (I20> . It 
would be interesting to extend the training of N perceptrons given by \2\\ and to consider the general 
problem of learning in a set of interacting N perceptrons under the basic MG features: heterogeneity, 
minority rule and time delayed interactions. 

9 Dynamics matter 

The success of the solution obtained using the replica trick II15II47I and outlined in sectional established 
the concept that inductive reasoning of N interacting agents in the MG is described by the minimization 
of a global function, the response Ti. Thus, the MG seems to be similar to an equilibrium problem of 
disordered systems. But a simple numerical experiment shows that this is not the case |28 1: Suppose 
that agents are confident about one of their strategies and they start by giving it a period of grace, i.e. 
an initial number of points different from zero. This means that pf (0) ^ for some a in equation (|2j. 
Simulations in figureE]show that this tiny modification of the MG model has a profound impact in the 
results for a < ac- Specifically, the high volatility region is lost and a region in which volatility is very 
small emerges. Moreover, the steady state volatility for a < ac depends on the initial condition p"(0) 
|22||28||34|l 

The fact that the steady state reached by agents through adaptation depends on the initial condition 
for a < ac means that this region is non-ergodic and that the observed change of behavior at a = ac is 
due to a non-equilibrium phase transition instead of an equilibrium transition. A precise mathematical 
description of this was reached through the generating functional methods of De Dominicis |24| by 
A. C. C. Coolen and coworkers |20 21 , 22, 34 1. This technique allows to carry out the average over 

'a similar model (named time horizon MG), was considered in [33 in which, in the original MG model, 'pf{t) are just the 
points of strategy a of agent i for the last M time steps. 

^'Due to the dependence on the initial condition for a < ac, experiments in which the number of agents N or the memory D 
are changed quasi-statically lead to hysteresis behavior in the MG 1581 . again demonstrating the non-equilibrium behavior in the 
region a < ac 
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Figure 1 1 : Simulations of the original MG model (with endogenous information) for different initial 
conditions and m — 7, s — 2 andp^(O) = (circles) andp^(O) = 100 (squares). Upper panel: dashed 
lines correspond to the stationary solution for a > etc and a^/N a^/^ for a < Uc- Lower panel: full 
symbols correspond to R while open symbols refer to 0. 



the quenched disorder in the system into the dynamical equations directly, instead of performing it in 
the equilibrium partition function. The outcome of this averaging is a set of closed equations for the 
correlation C(t, i') and response t') functions of the variables involved in the dynamical equations 
[pf (t) and (Ji{t)\ at different times. This system of dynamical equations for C{t,t') and R(t,t') is 
difficult to solve in general, but some analytical progress can be made if one looks at two particular 
situations: 

• For instance, one can investigate asymptotic stationary solutions, i.e. when C{t, t) = C{t — t') 
and R{t,t') = R{t — t'). and assume that the system has not anomalous response, i.e. that 
fluctuations around the stationary state are damped in finite time and the susceptibility is finite. 
In this case, the results obtained in section through the evaluation of the minima of Ti, are 
recovered. Moreover, this stationary solution predicts self-consistently when it breaks down: the 
susceptibility of the solution diverges at a = etc and the assumption of stationary in the solution 
is only valid for a > a^- 

• For a < OLc one can investigate the first time steps of the correlation and response function 
dynamics. It is found that the volatility depends strongly on the initial conditions: for s = 2 and 
depending on (5pi(0) a transition from high volatility jN ~ a^^ to low volatility ct^/A^ ~ a^/^ 
is found (see Fig. II It . 

These results point out that the phase transition found in the MG is a non-equilibrium transition and 
that only for a > the adaptation dynamics is relaxing and ergodic: for any initial condition a final 
unique steady state is reached which is described by the minima of 7i. For a < etc. the system is non- 
ergodic, since the system asymptotic behavior depends strongly on the initial conditions. After all, the 
MG is posed without requiring detailed balance or any kind of Fluctuation-Dissipation theorem about 
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its dynamics, so there is in principle no reason to expect that the MG should have the usual relaxing 
and ergodic dynamics of other models in statistical mechanics. Finally, it is interesting to note |22| than 
ergodicity in the MG is not broken at the macroscopic level as in replica symmetry breaking solutions in 
spin-glass; it is done at the microscopic level since the dynamical rules do not fulfill any type of detailed 
balance. 

The generating functional analysis proved to be the right tool to study the MG since its focus is 
on the dynamics of disordered systems, specially on spin glasses. Since the existence of a partition 
function is not needed, complicated cases like steady states with non-equilibrium fluctuation-dissipation 
relationships or violations of detailed balance can be tackled within this formalism. Indeed, the MG 
model features do not root in physical considerations so there is no reason a priori to expect detailed 
balance or fluctuation-dissipation relationships. 

10 Outlook 

The land of statistical physics is broad, with many dales, hills, valleys and peaks 
to explore that are of relevance to the real world and to our ways of thinking about 
it. 

Michael E. Fisher in Rev. Mod. P/iy.s. 70, 653 (1998). 

Through this journey we have learned that adaptation or inductive reasoning as that of the MG model 
leads to a system of many interacting and heterogeneous degrees of freedom whose dynamics is not 
local in time. The system may or may not be ergodic and, generally speaking, equilibrium statistical 
mechanics should be abandoned. However there are some similarities with statistical mechanics of 
disordered systems and some of the tools and concepts of disordered system do apply to the MG. The 
MG is just a model of adaptation of N agents which can be generaUzed to other types of learning as we 
saw in section|8] For those types of dynamical problems, the suitable tool from statistical physics is the 
generating functional approach we mentioned in section|9]which gives not only information about the 
properties of the system when a steady state is reached, but also about the dynamical nature of the phase 
transition. In fact, the generating function approach is also useful in different modifications of the MG 
[20 21 22 26J. 

Adaptation turned out to be a good way to solve the MG problem for a > Uc- But it might not be 
the best way to do it in the region a < etc- As a matter of fact, simple modifications of the original MG 

model like taking a non-zero initial condition or what is called the Thermal Minority Game 17. .8 18J , 

in which agent's decisions are taken with some degree of stochasticity (instead of using their best one 
always), lead to better solutions {a'^ /N ^ 1) of the MG problem than the original model. 

However, part of the success of the application of analytical techniques in the MG is due to its fixed 
rules. For example, in real life we do not expect to come across a game in which players have the same 
space of strategies, or in which the initial condition for all of them is that in which none of their strategies 
is preferred. Relaxing some of these rules in the MG does not modify its behavior; unfortunately, others 
do. During the last years we have witnessed a burst of spin-off models whose core is given by the MG 
fl 11. Of particular importance are those in which refinements of the model were towards the description 
of real markets 1 19 30 31 48 1 or real problems in ecology 1 10 1. The validity of the MG as a behavioral 
model of agents trying to spot patterns in aggregate outcomes has been even tested through experiments 
with people ll4l l52l showing that indeed people tend to coordinate despite its selfish nature in the game. 

On the other hand, there has been some extensions of the MG which has been used in the context of 
time series analysis and prediction by doing reverse engineering: since the MG is, as we saw in section 
|8] a precise learning rule, why not training a set of N agents with a real life time series? In that case, 
agents strategies are not fixed at the beginning of the game but rather, they are chosen to reproduced 
the given time series. This has been applied to financial times series in order to forecast future price 
movements f39]. 
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The virtue of the MG rehes on the incorporation of the minority rule and agents' heterogeneity 
in the model as well as an agent-based approach to the solution of a given problem. In this regard, 
during the last years there has been a lot of attention in the solution of general problems using multi- 
agent models 1 59 60 1. In fact coupled layers of perceptrons are just a simple example of this approach 
1 1 1. The general principles of those multi-agent systems is that agents individually have incomplete 
information or capabilities for solving the problem but can reach a solution by interacting among them. 
The success of understanding of the MG using tools from statistical mechanics encourages to tackle 
more general heterogeneous multi-agent systems problems within the theory of disordered systems in 
condensed matter. 
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